A survey in indexing and searching XML documents

نویسندگان

  • Robert Wing Pong Luk
  • Hong Va Leong
  • Tharam S. Dillon
  • Alvin T. S. Chan
  • W. Bruce Croft
  • James Allan
چکیده

Introduction This is my personal “summary in 337 one-liners” of A Survey in Indexing and Searching XML Documents by Luk et al. (2002) [1]. I focus on technical aspects, omitting all system names and references. In my opinion, one cannot learn any technique from the survey: it only mentions various techniques but does not explain any. Alas, my 337 one-liners are even less informative. The survey itself can be used as a test whether you already knew all the things that it mentions, and the classification that it gives. Further, the survey is useful as a rather complete collection of references to the literature (and this aspect is completely omitted in this summary). I am impressed by the apparent completeness of the survey, but I consider most of it badly written (unclear, incomprehensible and in bad English).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Indexing and Searching XML Documents Based on Content and Structure Synopses

We present a novel framework for indexing and searching schema-less XML documents based on concise summaries of their structural and textual content. Our search query language is XPath extended with full-text search. We introduce two novel data synopsis structures that correlate textual with positional information in an XML document and improves query precision. In addition, we present a two-ph...

متن کامل

خوشه‌بندی فراابتکاری اسناد فارسی اِکس‌اِم‌اِل مبتنی بر شباهت ساختاری و محتوایی

Due to the increasing number of documents, XML, effectively organize these documents in order to retrieve useful information from them is essential. A possible solution is performed on the clustering of XML documents in order to discover knowledge. Clustering XML documents is a key issue of how to measure the similarity between XML documents. Conventional clustering of text documents using a do...

متن کامل

Indexing XML Objects with Ordered Schema Trees

XML DBMSs require new indexing techniques to efficiently process structural search and full-text search as integrated in XQuery. Much research has been done for indexing XML documents. In this paper we first survey some of them and suggest a classification scheme. It appears that most techniques are indexing on paths in XML documents and maintain a separated index on values. In some cases, the ...

متن کامل

Flexible Querying of XML Documents

Text search engines are inadequate for indexing and searching XML documents because they ignore metadata and aggregation structure implicit in the XML documents. On the other hand, the query languages supported by specialized XML search engines are very complex. In this paper, we present a simple yet flexible query language, and develop its semantics to enable intuitively appealing extraction o...

متن کامل

Searching XML Documents - Preliminary Work

Structured document retrieval aims at exploiting the structure together with the content of documents to improve retrieval results. Several aspects of traditional information retrieval applied on flat documents have to be reconsidered. These include in particular, document representation, storage, indexing, retrieval, and ranking. This paper outlines the architecture of our system and the adapt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIST

دوره 53  شماره 

صفحات  -

تاریخ انتشار 2002